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Abstract 

This paper studies a new decentralized resource allocation strategy, named iterative spectrum shaping (ISS), for 
the multi-carrier-based multiuser communication system, where two coexisting users independently and sequentially 
update transmit power allocations over parallel subcarriers to maximize their individual transmit rates. Unlike the 
conventional iterative water-filling (IWF) algorithm that applies the single-user detection (SD) at each user's receiver 
by treating the interference from the other user as additional noise, the proposed ISS algorithm applies multiuser 
detection techniques to decode both the desired user's and interference user's messages if it is feasible, thus 
termed as opportunistic multiuser detection (OMD). Two encoding methods are considered for ISS: One is carrier 
independent encoding where independent codewords are modulated by different subcarriers for which different 
decoding methods can be applied; the other is carrier joint encoding where a single codeword is modulated by all 
the subcarriers for which a single decoder is applied. For each encoding method, this paper presents the associated 
optimal user power and rate allocation strategy at each iteration of transmit adaptation. It is shown that under 
many circumstances the proposed ISS algorithm employing OMD is able to achieve substantial throughput gains 
over the conventional IWF algorithm employing SD for decentralized spectrum sharing. Applications of ISS in 
cognitive radio communication systems are also discussed. 

Index Terms 

Spectrum sharing, interference channel, multi-carrier systems, decentralized resource allocation, multiuser 
detection, iterative water-filling, cognitive radio. 

I. Introduction 

This paper is concerned with spectrum sharing in a multiuser communication system based on multi- 
carrier modulation techniques such as discrete multitone (DMT) for wired-line communication and 
orthogonal frequency division multiplexing (OFDM) for wireless communication. It is assumed that 
neither the users' transmitters nor their receivers are collocated and as a result there is no centralized 
control over the users' transmissions. In addition, all users are assumed to transmit over the same frequency 
band and thus possibly interfere with each other. The above scenario exists in many wire-line/wireless 
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broadband communication systems in practice, e.g., the DMT-based digital subscriber line (DSL) network, 
and the OFDM-based wireless ad hoc network. 

The system of interest is in nature a competitive environment due to the lack of cooperation among 
the users. Therefore, decentralized strategies for allocation of users' transmit resources such as pow- 
ers, bit rates, bandwidths, and/or antenna beams become crucial to the achievable system throughput. 
Consequently, a great deal of valuable scholarly work has been done in the literature on this study. 
For the conventional narrow-band spectrum sharing over single- antenna slow-fading channels, distributed 
transmit power control has been studied in, e.g., [l]-[4], for minimizing the sum power consumption 
to meet with each individual user's quality-of- service (QoS) requirement. Following the similar problem 
formulation, decentralized joint power control and beamforming have been studied in, e.g., [5]-[7] for 
the case of multi-antenna transceivers. In [8], a decentralized power allocation strategy so-called iterative 
water-filling (IWF) was proposed for a 2-user DSL system, where each of the two users independently 
and sequentially updates transmit power levels over different subcarriers so as to maximize individual 
transmit rate, subject to the coexisting user's interference treated as additional background noise at the 
receiver. Because of its practical advantages for implementation, the IWF algorithm has been thoroughly 
investigated in the subsequent literature. For example, in [9], [10], IWF has been studied for spectrum 
sharing scenarios with more than two users. In [1 1], [12], conditions on the convergence of IWF have been 
rigourously characterized. Motivated by IWF, semi-centralized and centralized power allocation schemes 
for multiuser spectrum sharing have also been studied in [13]— [15] and [16]— [18], respectively, all based 
on the primal-dual Lagrange duality approach. 

The existing works on decentralized/centralized resource allocation schemes for multiuser spectrum 
sharing [1]— [1 8] have mostly assumed the single-user detection (SD) at the receiver by treating the 
interference from the other coexisting users as additional noise, mainly because of implementation ease 
of the proposed schemes. During the past decade, multiuser detection techniques (see, e.g., [19] and 
references therein) have been thoroughly studied in the literature, and proved under many circumstances 
to be able to provide substantial performance gains such as rate improvement and decoding error reduction 
over the conventional SD. This fact motivates this paper to make an attempt to combine the well-known 
IWF with multiuser detection such that at each iteration of user transmit adaptation, the corresponding 
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user is able to decode both the desired message and some/all of the interference users' messages - 
thereby reducing the overall interference at the receiver - if such decoding is feasible, thus termed as 
opportunistic multiuser detection (OMD). The resultant new decentralized resource allocation algorithm 
is named iterative spectrum sharing (ISS). Note that the proposed ISS maintains the main advantage 
of IWF to be a purely decentralized algorithm, while it improves over IWF via replacing the SD by 
the more advanced OMD. With OMD, the transmission of the updating user at each iteration subject 
to concurrent transmissions of the other coexisting users can be generally modeled by the Gaussian 
multiple-access channel (MAC) [20], whereas there is a key difference pointed out as follows. Unlike 
the conventional MAC, the coexisting users considered in this paper are non-cooperative in allocating 
transmit rates/powers over subcarriers due to the lack of centralized control over their transmissions. As 
a result, whether OMD should be applied and over which subset of users it should be applied depend 
on the instantaneous channel gains as well as the interference users' power and rate allocations. Note 
that the OMD in the context of this paper is analogous to the "successive group decoder (SGD)" in the 
fading MAC with unknown channel state information (CSI) at the user transmitters (see, e.g., [21] and 
references therein). The main contributions of this paper are summarized as follows: 

• This paper considers two encoding methods for the proposed ISS. One is carrier joint encoding 
(CJE) where a single codeword is modulated by all the subcarriers and is decoded at the receiver 
by a single decoder. The other encoding method is designed to maximally exploit the advantage of 
OMD, named carrier independent encoding (CIE), where independent codewords are modulated by 
different subcarriers and thus allow for variable rate assignments and adaptive decoding methods. For 
both encoding methods, this paper derives the optimal user power allocation strategies to maximize 
individual transmit rate at each iteration. The derived power allocation schemes are shown to be 
non-trivial extensions of the standard "water-filling" (WF) power control [20] for IWF 

• This paper investigates the converged user power spectrums by the proposed ISS, and compares them 
to those by IWF for various system setups. Such comparison reveals some important insights on 
why ISS is able to outperform IWF in terms of the achievable system throughput for decentralized 
spectrum sharing. 

The rest of this paper is organized as follows. Section |II] presents the system model of multi-carrier- 
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based multiuser spectrum sharing. Section [III] provides the problem formulations to determine the optimal 
user power allocation policies for the proposed ISS with CTE and CJE. Section [IV] presents the solutions 
to the formulated problems. Section [V] provides the simulation results to demonstrate the performance 
gains of ISS over IWF. Finally, Section [VI] concludes the paper. 

II. System Model 

Consider a typical spectrum sharing scenario where K users transmit independent messages to their 
corresponding receivers simultaneously over the same frequency band. For the purpose of exposition, in 
this paper it is assumed that K = 2, while the general case of K > 2 is to be studied in the future work. 
Both the users are assumed to adopt a multi-carrier (DMT/OFDM) -based transmission and have the 
same symbol period and cyclic prefix (CP) period that is assumed to be larger than the maximal signal 
multipath spread of the two users. The total bandwidth for spectrum sharing is equally divided into N 
orthogonal sub-channels. For the time being, it is assumed that perfect time and frequency synchronization 
with reference to a common clock system have been established for both the users prior to their data 
transmission. In addition, it is assumed that the difference between the propagation delays from the two 
user transmitters to either one of their receivers is much smaller than the CP period and, thus, such delay 
differences can be safely accommodated within the CP period. Consider a block-based transmission for 
the two users with each block consisting of L DMT/OFDM symbols, while L is usually a large number to 
guarantee sufficient coding protection within each block transmission. For typical wireless applications, 
it is also assumed that the block duration is sufficiently small as compared to the coherence time of any 
channel between the users. Thus, all the channels involved in this paper can be assumed to be block 
fading (BF), i.e., they are constant during each block transmission but can vary from block to block. 
Based on the standard DMT/OFDM modulation and demodulation, the discrete-time baseband signals 
for the system of interest are given by 

2/1, n = 

V2,n = h,22,nX2,n + /ll2,n#l,n + ^2,n (1) 

where n — 1, . . . , N is the subcarrier index; x i>n and y ijTl are the transmitted signal and received signal at 
subcarrier n, respectively, for user i — 1,2; hn,n an d ^22,n are me "direct" channel complex coefficients 
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for user 1 and 2, respectively, at subcarrier n, while h?,i,n an d ^i2,n are the "interference" channel complex 
coefficients from user 2 to 1, and from user 1 to 2, respectively, at subcarrier n; and z iyU is the receiver 
noise at subcarrier n for user i = 1,2. Note that both the block and symbol indexes are dropped in 
(OQ) for conciseness. Without loss of generality, it is assumed that {^ )n },Vi,n are independent circularly 
symmetric complex Gaussian (CSCG) random variables (RVs) each having zero mean and unit variance. 
It is also assumed that x i n 's are independent RVs each with zero mean and respective variance p iyU , while 
Pi >n denotes the transmit power allocated to subcarrier n of user i. Let Pi and P2 denote the average 
transmit power constraint for user 1 and 2, respectively. It thus holds that J2n=iPh™ — * = ^ ^- 

Two encoding methods are considered at each user transmitter. One is carrier independent encoding 
(CIE), where each subcarrier is assigned an independent codebook and from each codebook a codeword 
is chosen to be modulated into L consecutive DMT/OFDM symbols at the corresponding subcarrier in 
each block. At the receiver, N independent decoders are used to decode the corresponding messages 
from different subcarriers. Let r^ n denote the rate of the codebook assigned to user i at subcarrier n. 
The average transmit rate of user i then becomes Rf IE = J2n=i r i,n- The other encoding method is 
carrier joint encoding (CJE), where a single codebook is used for each block transmission and only 
one codeword is chosen from this codebook and is modulated into all N subcarriers of L DMT/OFDM 
symbols. At the receiver, a single decoder is used to decode the message from all the subcarriers. Let 
Rf JE denote the rate of this single codebook for user i. Comparing CIE and CJE, it is easily seen that CIE 
requires more encoding and decoding complexities over CJE, due to the use of independent codebooks 
over different subcarriers. In addition, for the same finite value of L, the effective codeword length for 
CIE is reduced by a factor 1/N as compared to that for CJE, thus resulting in inferior error-correcting 
capabilities. Therefore, the existing multi-carrier-based transmission systems in practice have all chosen 
to use CJE instead of CIE. Nevertheless, it is worth noticing that CIE provides more flexibility over CJE 
in adaptive rate assignments and decoding methods over subcarriers, which, as will be shown later in 
this paper, can be a beneficial factor for the proposed ISS under certain circumstances. 

The system model considered in this paper is known as the 2-user parallel Gaussian interference 
channel, for which characterization of the capacity region is in general still an unsolved problem (see, 
e.g., [22] and references therein). Nevertheless, achievable rates of this channel have been thoroughly 



studied in the literature based on different assumptions on the level of cooperations between the users 
for encoding and decoding as well as power and rate allocations over the subcarriers. In this work, we 
constrain our study on this channel by making the following major assumptions: 

• Each of the two users only has the knowledge on its own channel as well as the channel from the 
other user's transmitter to its receiver. 

• Each of the two users independently and sequentially updates its transmit power allocations over 
different subcarriers to maximize individual transmit rate. 

• Each of the two users is able to obtain the knowledge on transmit rates/rate (for CIE/CJE) of the 
other user over subcarriers; and both the users employ the same type of encoding method (CIE or 
CJE) and the same set of codebooks. Thereby, at one user's receiver, it is possible to apply multiuser 
detection (MD) to decode both the desired user's message and the interference user's message. 

Note that in the above assumptions, the first two are due to practical considerations and are same as 
those made by the conventional IWF proposed in [8]HJ while the third assumption is a new one and is not 
present in IWF where only the single-user detection (SD) is applied. The decentralized resource allocation 
scheme motivated by IWF while employing the more advanced MD is named iterative spectrum sharing 
(ISS) in this paper. 

III. Problem Formulation 

In this section, problem formulations are provided for the users to determine their transmit power and 
rate allocations over different subcarriers at each iteration of transmit adaptation. Both encoding methods, 
namely, CIE and CJE, are considered. For brevity, only user l's transmit adaptation is addressed here, 
while the developed results also apply to user 2. 

Consider first CIE. At a particular iteration for user 1 to update its transmission, since user 2's transmit 
powers {p2,n} and rates {r2, n } over different subcarriers are fixed values, the maximum transmit rate of 
user 1 at subcarrier n with an arbitrary allocated transmit power p ljH can be expressed asP 

'More precisely, in the first assumption on the known interference channel between the users, only the channel gain is to be known for 
SD of IWF while both the channel gain and phase information are required for MD of the proposed scheme. 

2 For the purpose of exposition, continuous rate and power values are assumed in this paper. In addition, it is assumed that the optimal 
Gaussian codebook is employed by the two users. The developed results in this paper are readily extended to the more practical cases 
with discrete power and rate values and/or non-optimal modulation and coding schemes via, e.g., applying the optimal discrete bit-loading 
algorithm with the "SNR gap" approximation [23]. 
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where C(x) = log 2 (l + x) is the capacity function of the AWGN channel [20], while h u >n = |/iii >n | 2 and 
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^2i,n — |^2i,n| 2 - The above result is illustrated in the following three cases corresponding to the three 
expressions of r 1 n in © from top to bottom. Note that the following discussions apply to any subcarrier 
n of user 1. 

• Strong Interference: In this case, the received interference signal power from user 2 at user l's 
receiver is sufficiently large such that the contained message with rate r 2in can be first decoded by 
SD with user l's signal taken as additional Gaussian noise. After that, by reconstructing the received 
user 2's signal and subtracting it from user l's message can be decoded by SD. The above 
operation is known as successive decoding in the MAC [20]. 

• Moderate Interference: In this case, the received signal power from user 2 is not as large as that 
in the previous case of strong interference and as a result, user 2's message can not be directly 
decoded by SD. However, it is still feasible for user 1 to apply joint decoding [20] to decode both 
users' messages^ In this case, the rate pair of the two users falls on the 45-degree segment of the 
corresponding MAC capacity region boundary [20]. 

• Weak Interference: In this case, the received signal power from user 2 is too weak to be decoded 
even without the presence of user l's signal. As such, user l's receiver has the only option of treating 
user 2's signal as the additional Gaussian noise and applying SD to decode directly user l's message. 
Note that the above SD is used in the conventional IWF regardless of the received signal power 
from the interference user (user 2). 

From the above discussions, it is known that MD is applied in both cases of strong and moderate 
interferences, but not in the case of weak interference. Thus, user l's receiver opportunistically applies 
MD to the interference user signal if it has a sufficiently large received power to be decoded either 
successively or jointly with the desired user signal. Therefore, the MD in the context of this paper is 
called opportunistic multiuser detection (OMD). 

3 Note that an alternative decoding method in this case is successive decoding along with "rate splitting" [24] or "time sharing" [20] 
encoding technique. However, these techniques require certain cooperation between the users and are thus not considered in this paper. 



In Fig. Q] (a), in © is illustrated. For conciseness, the index n is dropped here. It is assumed that 

p 2 = 1, r 2 = 0.5, and h 2 \ = h n = 1. Note that in this case r 2 < C(h 21 p 2 ) and thus OMD instead of SD 
should be applied. The rate achievable by SD, denoted by rf D (pi) = C( 1+ ^^ 2 ) from ©, is also shown 
for comparison. It is observed that user l's rate with OMD is improved over that with SD, and ri(pi) 
is the minimum of the two functions defined as f(pi) = C(hupi) and h(pi) = C(hnpi + h 2 \p 2 ) — r 2 , 
which are the rates achievable by successive decoding and joint decoding, respectively. The threshold 
value of px, denoted by p t h, for which ri(pi) = f(pi) if p\ < pth and otherwise ri(pi) = h(pi), is 
obtained from © as 

1 / h 21 p 2 \ 
Pth = i—\ ^- 7 - 1 • (3) 
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Note that p th > if r 2 < C(h 21 p 2 ). 

With ri , n (pi,n) given in © for all n's, the problem can be formulated for user 1 to optimize its power 
and rate allocations over subcarriers to maximize its average rate in the case of CIE. This problem is 
denoted as (PI) and is expressed as 

1 N 

(PI) max i?? E ({pi,„}) := - ^ r 1>n (pi, n ) 

n=l 

1 N 

n=l 

After (PI) is solved, from the obtained solution for pi >n at subcarrier n, the corresponding transmit rate 
and decoding method can be obtained from ©. The solution of (PI) is given later in Section HV-AI 

Next, the case of CJE is considered. Recall that R 2 JE and {p2,n} are user 2's transmit rate value and 
power allocations over subcarriers, respectively, which are all fixed for user l's transmit optimization. 
With joint encoding over all the subcarriers, the maximum transmit rate of user 1 under arbitrary power 
allocations {pi tn } is expressed as 

[ E[c(/n 1>nPl , B )] ^ JE <nc(j^StJ 

R^ E ({Pl,n}) = < E[C(V«Pl,» + h21,nP2,n)} ~ ^ ^^( ^^"J ] < R? E < E[C (h 21 , n p 2>n )] 

( nci^^t)] r?* > nc(h 21 , nP2 , n )] 

(4) 

where for notational brevity, E[-] is used to represent the operation ^ n=1 (-)- Note that the rate R® JE 
here is analogous to the ergodic capacity in wireless fading channels where a sufficient long codeword 
spans over all possible fading states and the codeword rate is the average of all the instantaneous mutual 



information of the channel at different fading states [25]. Similar to CIE, the three rate expressions of 
Ri JE in © are also achievable by successive decoding, joint decoding, and SD, respectively, whereas 
there is a key difference that only one of these decoding methods is applied over all the subcarriers for 
CJE, in contrast to the case of CIE, where each subcarrier can be independently assigned one of these 
decoding methodsjj Thus, unlike CIE, the user in the case of CJE does not have the flexibility for transmit 
rate and decoding method adaptations over different subcarriers, while it still can optimize over transmit 
power allocations and choose the best decoding method to maximize its transmit rate. 

The problem for user 1 to optimize its power allocations in the case of CJE is denoted as (P2), and is 
expressed as 

(P2) max R? E {{pi, n }) 

Pl,n>0,Vn 

1 N 

n=l 

After solving the optimal power allocations in (P2), the maximum transmit rate and its achievable decoding 
method can be obtained from ©. The solution of (P2) is provided later in Section HV-Bl 

IV. Optimal Power Allocation 

In this section, (PI) and (P2) for the case of CIE and CJE, respectively, are solved to obtain the optimal 
power allocations for user 1 at each iteration of transmit adaptation. It is shown that the obtained power 
allocation solutions in both cases are non-trivial variations of the standard WF solution [20], which is 
employed in IWF. 

A. Carrier Independent Encoding 

In this part, (PI) for the case of CIE is studied. The objective function of (PI) is the sum of N 
independent functions, ri n (pi n )'s, each of which can be easily shown to be a concave function of pi iTl . 
Therefore, the objective function is concave in {pi jn }. In addition, the constraint of (PI) is a linear function 
of pi in 's. Thus, (PI) is a convex optimization problem, and thus can be solved via convex optimization 
techniques. 

4 Due to frequency-selective channel variation, it may be possible that at some subcarriers of user 1, the interference channel gains from 
user 2 are sufficiently large such that if CIE is used, OMD can be applied at these subcarriers to immediately remove the effect of these 
interferences, while in the case of CJE, whether OMD can be applied depends on the interference channel gains at all the subcarriers. 
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In (PI), the objective function is separable in n while the constraint is not. Therefore, the Lagrange dual 
decomposition method, which has been applied in prior works (see, e.g., [13]-[18]), is also proposed here 
to decouple the constraint in n, and thereby decomposes (PI) into a set of N independent subproblems 
each for a different subcarrier. First, the Lagrangian of (PI) is written as 

l N 1 N 

£({Pi,n},A) = j-f^2r 1)n (p 1>n ) - A(— ^Pi, n - Pi) (5) 

n=l n=l 

where A is the non-negative dual variable associated with the power constraint. Then, the Lagrange dual 
function of (PI) is defined as 

g(X) = max £({pi,„}). (6) 

Pl,ra>0,Vn 

The value of the dual function serves as an upper bound on the optimal value of the original (primal) 
problem, denoted by r*, i.e., r* < g(X) for any A > 0. The dual problem of (PI) is then defined as 
minA>o(?(A). Let the optimal value of the dual problem be denoted by d*, which is achievable by the 
optimal dual solution A*, i.e., d* = <?(A*). For a convex optimization problem with a strictly feasible 
point, the Slater's condition [26] is satisfied and thus the duality gap, r* — d* < 0, is indeed zero for (PI). 
This result suggests that (PI) can be equivalently solved by first maximizing its Lagrangian to obtain the 
dual function for some given dual variable A, and then solving the dual problem over A > 0. 

Consider first the problem for maximizing the Lagrangian to obtain the dual function g(\) for some 
given A. It is interesting to observe that g(X) can be rewritten as 

1 N 

71=1 

where 

g n {\) = max r ln {p ln ) - \p ln n = l,...,N. (8) 

Pl,n>0 

By this way, g(X) can be obtained via solving a set of N independent subproblems, each for a different 
subcarrier n. Note that the maximization problems in © at different n's all have the same structure and 
thus can be solved using the same computational routine. For conciseness, the index n is dropped in ([8]) 
and the resultant problem is re-expressed as 

(P3) max a(p{) := ri(pi) - Api 

pi>0 
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where r\(pi) is given by © with the index n dropped. 

Solutions of (P3) for all the subcarriers can then be used to obtain the dual function g{X) in © 
for any given A. Then, the dual function needs to be minimized over A > in the dual problem to 
obtain the optimal dual solution A* with which the duality gap is zero, i.e., the original problem (PI) is 
equivalently solved. The standard routine in convex optimization to iteratively update A toward its optimal 
solution is via the bisection method [26] based on the subgradient of g(X), which can be shown to be 
Pi ~~ ji X^Li Pi,n- When A = A*, the associated optimal solution of (PI), denoted by {pl n }, satisfies 
j?J2n=iP*n = Pi- F° r brevity, the details of this standard routine are omitted here. 

Next, the solution of (P3) is derived for some given A. Note that since r\(jp\) is a concave function of 
Pi, so is a(pi) and thus (P3) is a convex optimization problem. The following discussions are then made 
on the solution to (P3): 

If r 2 < Gihixp-i), from © it follows that OMD should be applied in this case. Note that p t h given in © 
satisfies p t h > in this case, and a(pi) is the minimum of two functions defined as f\(pi) = f(pi) — Xpi 
and h\(pi) = h(pi) — Xpi, where f(pi) and h(px) are defined earlier in Section |nij Also note that when 
Pi < Pth, a(pi) = /a(pi); otherwise, a(pi) = h x (pi). The optimal values of p\ that maximize f\(pi) and 
h\(pi) can be obtained as the standard WF solutions 



with (■)+ = max(0, ■) and 



Pl " ( (ln2)A h u } (1U) 



respectively. Note that < p^' < pj . Let a* denote the optimal value of (P3), which is achievable by 
the optimal solution p\, i.e., a* = a(p\). Since max Pl min(/x(pi), h\(pi)) < mm(f\(p[^),h\(p[ h ^)), it 
follows that a* < f\(p[ ) and a* < h\(p { { ) ). Based on this result, p\ is obtained for the following three 
cases: 

• Pth > p[^'- In this case, a(jp^p) = fx(Pi), thus it follows that a* > f\(p[^). Since it has been 
shown that a* < f\(p[^), it follows that a* = f\(p\ ) and p\ = p±, as shown in Fig. [U(b). Note 
that successive decoding is optimal in this case. 

• Pth < P^'- Similar to the first case, it can be shown that a* = h\(p^) and thus p\ = p^ \ as shown 
in Fig. Q] (d). Note that joint decoding is optimal in this case. 
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• Pi^ 1 < Pth < Pi - Since p t h < p[ , it follows that f\(pi) is an increasing function for pi < p t h- 
Moreover, since a(p x ) = f x (pi), for p x < p th , it follows that f x (p th ) > a(pi) for any p x < p th . 
Similarly, it can be shown that h x (p th ) > a(pi) for any p 1 > p th . Since h x {p th ) = fx(Pth), it 
concludes that p\ = p t h, as shown in Fig. \T\ (c). In this case, either successive decoding or joint 
decoding achieves the optimum, while this paper adopts the former due to its more implementation 
ease over the latter. 

If r 2 > C(/i 2 ip 2 ), SD should be used. Note that p th < in this case. It is easy to show that the optimal 
solution p* of (P3) in this case is same as pp in (flOl) obtained earlier. Note that this WF-based power 
allocation policy is also used in IWF 

By summarizing the above discussions, the following theorem is obtained: 

Theorem 4.1: The optimal solution of (PI) at subcarrier n, n = 1, . . . , N, is (with the index n dropped 
for conciseness) 

/if) if) 
Pi, Pth > Pi 

* I Pth, p[ h) <Pth <p[ f) 

Pi, < pth < p\ 

{ P [ h \ Pth < o 

where p th is given in ©, while and p^ are given in © and (flOl) . respectively, with A = A*. The 
corresponding optimal decoding methods at subcarrier n are (from top to bottom) successive decoding, 
successive decoding, joint decoding, and SD, respectively. 

In Fig. [2l the optimal power allocation p\ in (flTT) at a particular subcarrier n is shown for different 
values of A*. Note that A* is a decreasing function of user'l average power constraint Pi. Only the 
case of r 2 < C(/i 2 ip 2 ) where OMD should be applied is considered here. Thus, p th > and only the 
first three expressions of pi in (fTTI) are illustrated in this figure. It is observed that the obtained power 
allocation is a variation of the standard WF solutions, e.g., p[^ in © and p^ in (flOl) . There are two fixed 
noise levels = 1/hn and = (1 + /i 2 ip 2 )//hi, corresponding to the power allocations p^ and 
Pi , respectively. The amount of power (water) to be allocated (filled) then depends on the water-level 
l/((ln2)A*). If Pi is sufficiently large such that l/((ln2)A*) > w^> and at the same time Pi is sufficiently 
small such that l/((ln2)A*) < +p th , then p\ = l/((ln2)A*) - = p[ f) ; if Pi is sufficiently large 
such that l/((ln2)A*) > + p th , but not yet large to make l/((ln2)A*) > + p t h, then p\ = p th 
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regardless of A* and the resultant noise-plus-power level is below the water-level l/((ln 2)A*)1£| if P\ is 
sufficiently large such that l/((ln2)A*) > w {s) + p th , then p\ = l/((ln2)A*) - w (h) = p[ h) . The above 
three cases are illustrated by Fig. [2] (a), (b), and (c), respectively. 

B. Carrier Joint Encoding 

Next, the problem (P2) for the case of CJE is studied. Similar to the case of CTE, it can be shown 
that Rf JE ({pi,n}) in © is a concave function of {pi, n } and thus (P2) is a convex optimization problem. 
Similar to (PI), the Lagrange duality method is applied to solve (P2). Like (PI), the Lagrangian and the 
dual function for (P2) can be obtained, and it can be shown that (P2) has a zero duality gap. For brevity, 
these details are skipped here and the min-max form of (P2) is directly given as 

1 N 

min max R° JE ({pi,„}) - //(— YV„ - Pi) (12) 

n=l 

with /i denoting the non-negative dual variable associated with the transmit power constraint. The optimal 
dual solution of fi, denoted by fi*, in the above minimization problem can be similarly obtained by the 
bisection method as in (PI). In the following, the maximization problem in (fT2l) over {pi in } with some 
fixed fi is addressed, which can first be simplified as (by removing the irrelevant constant term) 

(P4) max 6(KJ) := R? m ({pi, n }) ~ M%i,n]- 

Pl,n>0,Vn 

Similar to (P3), the following two cases are studied for (P4): 

If i?2 JE < E[C(/i2i,nP2,n)] s it is known from © that OMD should be used in this case. Compared 
with the previously studied case of CIE, the power optimization in the case of CJE is more involved, 
as explained as follows: From ©, it is easy to show that if i?^ JE < E[C(/i 2 i,nP2,n)]> Ri JE ({Pi,n}) can 
be expressed as the minimum of two functions defined as / M ({pi, n }) — E[C(/in )n pi n )] — /^Efpx n] and 
K({pi, n }) - E[C(h lhn p hn + h 2 i, n P2,n)] ~ R§ JE - fJ^\pi,n]> Then, let f^nip^n) - C(hn,nPl,n) - HPl,n 



and h^ n (pi )fl ) = C(h llin p ltTl + h 2 i, n P2,n) ~ R% ~ P-Pi,n, n = 1, . . . , N, be the component in and h 



at subcarrier n, respectively, i.e., = E[/ MiTl ], = E[h^ n }. Since min(/ At , h^) is not necessarily equal 
to E[min(/ M n , h^ n )], it is unclear whether Rf JE ({pi, n }) is separable in n, which makes unclear whether 
the maximization of b({p 1>n }) over {pi, n } is solvable directly by the dual decomposition method. 

5 It is noted that in realistic multi-carrier systems, the channel conditions vary from subcarrier to subcarrier and as a result it is unlikely 
that all the subcarriers will fall into this case and are thus allocated powers pth{n)'s regardless of A* or Pi. 
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Let b* denote the maximum value of &({pi, n }) achievable by the optimal solution {p 1;n }. Note that 
both / M ({pi, n }) and fo M ({pi, n }) are concave functions in {pi, n } and achieve their respective maximum 
values at 

^=((i^-i )+ ' n=1 -- w (13) 

Note that^] > pW,Vn. Since 6({pi, n }) = min(/ M ({p lin }), MKn}). it follows that 6* < /„({p$}) 
and 6* < Next, the following cases are discussed on {p ln }: 

. E[C( h l^ f) )} > i? 2 CJE : In this case, &({pK» = / M ({pg}). Since 6* < / M ({pg}), it follows that 

fr* = //i({Pi^}) anc ^ mus Pin = Priv Note that successive decoding is optimal in this case. 
. E[C7( -ffi=gfa )] < it£ JE : In this case, &({pg}) = Mfc®)- Since 6* < ^({pg}), it follows 

that 6* = /v({Pi^}) and thus pj = p^ n . Joint decoding is thus optimal. 
• nC(-^m)] < R? E < E[C7(-^%)]: In this case, {„•„} is neither {p$} nor {pg}. 
Furthermore, by contradiction it can be shown that E[C( 1 ^ 1 i '" p ^" )] = i?2 JE must hold in this case. 
Thus, {p* n } can be obtained by solving either one of the following two equivalent problems: 

(P5) max E[C(/i n ,nPi,n)] - /^[pi,n] 

pi,n>0,Vn 

B . t . E [C( /l2 )' nP2 ' ra )] > ^ JE . 
(P6) max E[C(/in,„pi,„ + h 2 i,nP2,n)] - i*2 JE - A*E[pi,„] 

Pi,n>0,Vn 

s .t. E[C( /l2 )' wP2 ' ra )] < i? 2 CJE 

1 + fill,nPl,n 

Note that the objective functions of (P5) and (P6) are both concave in {pi, n }. However, (P5) is a 
non-convex optimization problem since its constraint is not necessarily convex due to the fact that 
C(j^) is a convex function of x for x > with any positive constants a and 6, while (P6) is 
a convex optimization problem since its constraint has the reversed inequality of that in (P5) and 
is thus a convex constraint. Therefore, without loss of generality, (P6) is considered for this case, 
while the obtained solution is optimal for both (P5) and (P6). Similar to the third case of (P3), 
both successive decoding and joint decoding achieve the maximum rate given the optimal power 
allocations, whereas the former is more preferable than the latter from an implementation viewpoint. 
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Lemma 4.1: The optimal solution of (P6) is 



n i i+fei ,nP2,n ^ r\ 

p(h) _ J U ' (In 2)^(0) ^ - U ^5-) 

x*, otherwise 



for n = 1, . . . , N, where x* is the unique positive root of the equation 



while F n (x n ) is defined as 



x _ 1 1 + h 2 l,nP2,n ,j 6 . 

(In2) fiF n (x n ) h u>n 



FJx n ) = 1 + hn ' nX , n (17) 

1 + h 1X „X n + vh 2 \, n P2,n 



and v > with which the constraint of (P6) is satisfied with equality. 

Proof: Please see Appendix HI ■ 
It is observed from (fT5l) and (fT6l) that the optimal solution of (P6) resembles a biased version of the 
standard WF solution {Pi„} given in (fl4l) because the associated water-level is biased by an additional 
factor F n , which itself is a function of the optimal power allocation. It is also observed from (fTTT) that the 
biasing factor is an increasing function of the allocated power. The algorithm that resolves the biasing 
factor F n (x n ) to obtain the solution of x n in (fT6l) is given in Appendix HO 

If i?2 JE > E[C(/i 2 i, n P2,n)]> from © it is known that SD should be applied at user l's receiver in this 
case and Rf JE ({pi, n }) = ^[^( i^f^f )]> which is separable in n. Thus, &({pi )fl }) is also separable in 
n and can be maximized independently over different n's. It is not hard to show that the optimal power 
allocations {pi n } in this case are equal to {pi „} given in (fl4)) . Note that the power allocation policy 
(fl"4l) is same as (fTOl) . which is used in IWF. Also note that the achievable rate of IWF is same with CIE 
or CJE. 

Summarizing the discussions on the above two cases, the following theorem is obtained: 
Theorem 4.2: The optimal solution of (P2) is 



Pl,n 



( r> U) 

Pl.ni 


E[C( 


~{h) 
Pl,m 


E[C( 


v {h) 

Pl.ni 


E[C( 


\ Pl.ni 


dCJE 
K 2 



l+hu, n p[ f l 



h21 - nP2 - n M < i?cJE < fe21 '" P2 ' ( "M )] 

1 + Zlll.nPi _ 



" 21 '" P2 '" " < ^ JE < E[C(h 21 , n p 2 , n )} 



-i . T (A) 

l+reil,nPi n 
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for n — 1, . . . , N, where p[{ ' n , p[ and p^l are given in (fl~3l) . (fl~4l) . and (fl"5l) . respectively, with = /i*. 
The corresponding optimal decoding methods are (from top to bottom) successive decoding, successive 
decoding, joint decoding, and SD, respectively. 

V. Simulation Results 

In this section, the performance of the proposed ISS algorithm with OMD is evaluated and compared 
to that of the conventional IWF algorithm with SD. It is assumed that the multi-carrier system has the 
number of subcarriers N = 64 and the CP period is equal to 1/4 of the symbol period. All the channels 
involved in the system, including users' direct channels and interference channels, are assumed to each 
have 16 independent, equal-power, multipath taps. In addition, a symmetric channel model is assumed 
where the two users' direct channels have the same average unit power, and the two interference channels 
between users have the same average power denoted by p, while p may take different values in order 
to investigate the effect of the interference between the two users on their achievable rates. In total, 
1000 independent channel realizations are simulated over which each user's achievable average rate is 
computed, while the rate loss due to the insertion of CP is ignored. For each channel realization, the 
multipath taps of the direct/interference channels are generated by independent CSCG RVs with zero 
mean and equal variance. The ISS/IWF algorithm is then implemented over each channel realization 
where the two users iteratively update their power allocations until their rates both get converged. 

In Fig. HI the achievable average sum-rate of the two users is shown for different values of the 
interference channel power gain, p. It is assumed that P\ = P 2 = 100. It is observed that the proposed 
ISS algorithm with either CIE or CJE improves the sum-rate over IWF, thanks to the more superior OMD 
over SD. It is also observed that the achievable sum-rate of IWF fluctuates over different values of p, 
while ISS ensures a consistent rate increase with p except the region of very low values of p where OMD 
is not frequently applied. Interestingly, it is observed that as p increases, ISS with CJE becomes superior 
over that with CIE in terms of the achievable sum-rate. Since CJE has a lower complexity to implement 
than CIE, this result provides a useful guidance for practical system design. However, this phenomenon 
is some counter-intuitive since CIE provides each user more flexibility for rate adaptations over different 
subcarriers and is thus expected to be more suitable than CJE to exploit the benefit of OMD. A reasonable 
explanation for this observation can be obtained by looking at a snapshot of the users' converged power 
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spectrums in this case, as shown in Fig. [5] for p = 10. It is observed that the two users' power spectrums 
in the case of IWF are close to be orthogonal in frequency, which suggests that "interference avoidance" 
is probably the expected solution by IWF in this case. In contrast, the power spectrums of the two users 
in the case of ISS with CIE are observed to be almost overlapped in frequency, as a result of OMD being 
applied at different subcarriers, while the spectrums in the case of ISS with CJE appear to be in between 
those of IWF and ISS with CIE. It is thus conjectured that neither completely orthogonal nor overlapped 
spectrum is the best converged solution for decentralized spectrum sharing, which could probably explain 
why ISS with CJE performs the best when the interference channel gains are large. 

In Fig. [6} the achievable users' individual rates are shown for a special case of the general channel 
model studied in this paper. In this case, a "cognitive radio" type of newly emerging wireless system is 
considered, where user 1 is the so-called primary (non-cognitive) user (PU) that is the legitimate user 
operating in the frequency band of interest, while user 2 is the secondary (cognitive) user (SU) that 
transmits at the same time over the same spectrum under the constraint that its transmission will not 
cause the PU's QoS to an unacceptable level. Note that a similar scenario has also been considered in 
[27]. The PU is non-cognitive since it is oblivious to the existence of the SU and, thus, it applies the 
conventional IWF algorithm with SD by treating the interference from the SU as additional noise. While 
for the SU, it is cognitive in the sense that it is aware of the PU and thus transmits with a much lower 
average power than that of the PU in order to protect the PU. In this simulation, it is assumed that 
Pi = 100 and P 2 = 1- In addition, since the SU is cognitive, it may choose to use the more advanced 
resource allocation scheme, e.g., ISS with OMD instead of IWF with SD. Two cases are then studied in 
this simulation: Case I, both user 1 and user 2 employ IWF; Case II, user 1 employs IWF while user 2 
employs ISS. Note that in both cases, CJE is assumed for both users since the PU, with no knowledge 
on the existence of the SU, should use CJE instead of CIE from a practical consideration. In Fig. [61 it 
is observed that the achievable rate of user 1 (the PU) drops slightly in Case II as compared to Case 
I when p is sufficiently large, while the achievable rate of user 2 (the SU) improves significantly. For 
example, at p = 1, user l's rate drop is only 3% (a negligible rate loss), while user 2's rate improvement 
is as large as 140% (a dramatic rate increase) by comparing Cases I and II. 

The above observations can be explained by looking at a snapshot of both users' converged power 
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spectrums (normalized by users' respective average powers) at a typical value of p = 5 dB, as shown in 
Fig. [7J It is observed that user l's spectrum does not change much over the two cases, while user 2's 
spectrum changes dramatically from a very "peaky" one in Case I to a more spread one in Case II. The 
SU's rate improvement in Case II over Case I is due to OMD, which removes the effect of the PU's 
interference and thus the SU can allocate powers based on its own channel condition, while the PU's 
rate drop in Case II over Case I is due to the "interference diversity" phenomenon [28], namely, the 
more peaky interference in Case I is more advantageous for minimizing the resultant PU's rate loss as 
compared to the more spread one in Case II. 

VI. Concluding Remarks 

This paper studies a new decentralized resource allocation scheme, ISS, for multi-carrier-based mul- 
tiuser spectrum sharing. ISS maintains the main advantages of the well-known IWF algorithm, e.g., 
being purely distributed and requiring only practical channel knowledge, while it improves over IWF by 
exploiting OMD at the user receiver. The resultant benefits are twofold: First, OMD improves the user 
transmit rate at each iteration of resource adaptation as compared to SD; Second, ISS with OMD leads 
to more balanced converged user power spectrums than IWF with SD. 

This paper presents the very initial results on ISS, for which many issues remain unaddressed yet and 
are worth further investigating. First, it is shown by simulation that for ISS, CJE performs better than 
CIE with large interference channel gains, while the opposite is true for moderate or small interference 
channel gains. This observation raises the question on whether there exists an optimal multi-band encoding 
scheme that divides the total bandwidth into multiple sub-bands over which CIE is applied while within 
each sub-band CJE is applied. Second, simulation results verify that the convergence of ISS, like IWF, 
is always guaranteed with realistic channel realizations, while characterizing the exact conditions for the 
convergence of ISS is an important topic for the future study. Last, extending the results of this paper to 
the cases with more than two users and/or multi-antenna terminals will also be interesting. 

Appendix I 
Proof of Lemma [4TI 

Since (P6) is a convex optimization problem, the Lagrange dual decomposition method can be applied 
to solve it, similar to that for (PI). Let v be the dual variable associated with the constraint of (P6). Since 
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it is already known that for the problem of interest the constraint is satisfied with equality, it follows that 
v > from the Karush-Kuhn-Tucker (KKT) optimality condition [26]. Then, (P6) can be written as the 
following equivalent min-max optimization problem: 

min max E[C(h n> nPi,n + h 21>n p 2 , n )} - R^ E - fjE\p 1<n ] - u(E[C( - h21 ' nP2 ' n )] - R$™) (1 9 ) 

^>0 pi,„>0,Vn 1 + Aiil,nPl,n 

where the "min" part can be solved by the bisection method similarly like (PI), while the "max" part 
for some given v can be decomposed into N subproblems each for a different subcarrier. At subcarrier 
n, the associated subproblem is expressed as 

max C{h xl , n p ln + h 2 i, n P2,n) ~ HPi, n ~ vC { \ h21 ' nP2 ' n ) (20) 

P1,»>0 1 + hn,nPl,n 

Let 8 n be the non-negative dual variable associated with the constraint p ln > 0. The KKT optimality 
conditions for the optimal primal and dual solutions of the above problem, denoted by p* l n and 8*, 
respectively, are then obtained as 

* 1 1 h'21,nP2,n * r* n * \ n r* \ n 

= Mi^wiiy h^— ' p '^» = £ °' a » £ 

where F n (-) is given in (TT7T) . From the above KKT conditions, by considering the following two cases: 
(1) 8* > 0, p* = 0; and (2) p\ n > 0, 8* = 0, dHJ) can be correspondingly obtained. 

Appendix II 
Algorithm to Solve COD) 

The algorithm to obtain the unique positive root x* of the equation (fT6l) is given in this appendix. 
Define C7 n (a; n ) = l/((ln2)//F n (x n )). Note that G n is a decreasing function of x n for x n > 0, and 
G n (0) > ( n = (1 + h 2 i, n P2,n)/hu,n fro m (fl5T) . and C7„(oo) = l/((ln2)/x). As shown in Fig. [3l x* is 
then obtained as the intersection between a 45-degree line starting from the point (0, ( n ) and the plot 
of the function G n (x n ) in the region of x n > 0. Numerically, x* n can be obtained by a simple iterative 
algorithm based on the bisection search described as follows. Let x* e [0,x™ ax ], where x™ ax is an upper 
bound on x* n . A proper value of x™ ax may be G n (0) — ( n from Fig. [3] For the first iteration, let x n be the 
midpoint of the initial interval for x* , i.e., x n = |x™ ax . The value of G n (x n ) — ( n is then computed, and 
compared to x n : if it is larger than x n , it follows that x* > x n and thus x* E (^x™ ax , x™ ax ]; otherwise, 
< a^n an d x* G [0, |x™ ax ]. Thereby, after the first iteration, the interval for searching x* is reduced 
by half. The above process is repeated until x* is found within any given accuracy. 
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(a) (b) 




Fig. 1. Illustration of the functions ri(pi) and a(pi) = ri(pi) — \pi in the case of r2 < G(hnP2)- Sub-figure (a) illustrates the function 
n(pi); sub-figures (b), (c), and (d) illustrate the function a(pi) = min(/A(pi), h\(p\)) for A = 0.65, 0.5, and 0.4, respectively, where the 
function's maximum value is achieved by pi = p[ ,pth> an d p[ , respectively. 
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Fig. 2. Illustration of the optimal power allocation i ll It in the case of r 2 < C(hih 2 ): (a) w (f) < < +p th \ (b) w u) +p th < 

, ln < + pth\ and (c) , ln ^ A , > vP^ + Pth- The height of the grey area in each case is the corresponding allocated power. 




Fig. 3. Illustration of the unique positive root x* n for the equation d!6t . 
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The achievable sum-rate versus the average interference channel power gain p between the users for Pi 



= 100. 
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Fig. 5. A snapshot on the converged user power spectrums in the case of P\ = P2 = 100, and p = 10. 
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Fig. 6. The achievable user rates versus the average interference channel power gain p for Pi = 100 and P2 = 1. 




Fig. 7. A snapshot on the converged user power spectrums in the case of Pi = 100, P2 = 1, and p — 5 dB. 



